Consistent Risk Estimation in Moderately High-Dimensional Linear Regression
نویسندگان
چکیده
Risk estimation is at the core of many learning systems. The importance this problem has motivated researchers to propose different schemes, such as cross validation, generalized and Bootstrap. theoretical properties estimators have been extensively studied in low-dimensional settings, where number predictors p much smaller than observations n. However, a unifying methodology accompanied with rigorous theory lacking high-dimensional settings. This paper studies risk under moderately asymptotic setting n,p → ∞ n/p δ > 1 ( fixed number), proves consistency three that successful numerical studies, i.e., leave-one-out validation (LOOCV), approximate (ALO), message passing (AMP)-based techniques. A corner stone our analysis bound we obtain on discrepancy `residuals' obtained from AMP LOOCV. connection not only enables us more refined information estimates AMP, ALO, LOOCV, but also offers an upper convergence rate each estimator.
منابع مشابه
Consistent group selection in high-dimensional linear regression.
In regression problems where covariates can be naturally grouped, the group Lasso is an attractive method for variable selection since it respects the grouping structure in the data. We study the selection and estimation properties of the group Lasso in high-dimensional settings when the number of groups exceeds the sample size. We provide sufficient conditions under which the group Lasso selec...
متن کاملPivotal Estimation in High-dimensional Regression via Linear Programming
We propose a new method of estimation in high-dimensional linear regression model. It allows for very weak distributional assumptions including heteroscedasticity, and does not require the knowledge of the variance of random errors. The method is based on linear programming only, so that its numerical implementation is faster than for previously known techniques using conic programs, and it all...
متن کاملPackage hdlm: Regression Tables for High Dimensional Linear Model Estimation
We present the R package hdlm, created to facilitate the study of high dimensional datasets. Our emphasis is on the production of regression tables and a class ‘hdlm’ for which new extensions can be easily written. We model our work on the functionality given for linear and generalized linear models from the functions lm and glm in the recommended package stats. Reasonable default options have ...
متن کاملRobust High-Dimensional Linear Regression
The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the most important factors in predicting outcomes. However, the economic importance of learning has made it a natural target for adversarial manipulation of trainin...
متن کاملMultiple testing in high-dimensional linear regression
In many real-world statistical problems, we observe a large number of potentially explanatory variables of which a majority may be irrelevant. For this type of problem, controlling the false discovery rate (FDR) guarantees that most of the discoveries are truly explanatory and thus replicable. In this talk, we propose a new method named SLOPE to control the FDR in sparse high-dimensional linear...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Information Theory
سال: 2021
ISSN: ['0018-9448', '1557-9654']
DOI: https://doi.org/10.1109/tit.2021.3095375